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We derive upper bounds on the rate of transmission of classical information over quantum chan- 
nels by block codes with a given blocklength and error probability, for both entanglement-assisted 
and unassisted codes, in terms of a unifying framework of quantum hypothesis testing with re- 
stricted measurements. Our bounds do not depend on any special property of the channel (such as 
memorylessness) and generalise both a classical converse of Polyanskiy, Poor, and Verdu as well as 
a quantum converse of Renner and Wang, and have a number of desirable properties. In particular 
our bound on entanglement-assisted codes is a semidefinite program and for memoryless channels its 
large blocklength limit is the well known formula for entanglement-assisted capacity due to Bennett, 
Shor, Smolin and Thapliyal. 



I. INTRODUCTION 

This work is concerned with the transmission of classi- 
cal information over quantum channels by means of block 
codes. This is a central subject of study in quantum 
information theory, and the asymptotic rates of trans- 
mission for various types of code and channel in the 
large blocklength limit are the subject of celebrated theo- 
rems and intriguing open problems. A more fundamental 
problem, of both theoretical and practical interest, is to 
obtain upper (or converse) and lower (or achievability) 
bounds on the optimal transmission rate for a given er- 
ror probability e and finite blocklength n. As Figure [T] 
illustrates, finite blocklength effects can be substantial. 

Without assumptions on the structure of the opera- 
tion implemented by n channel uses (e.g. independence), 
there is only a notational difference between coding for 
n uses of a channel and coding for one use of a larger, 
composite channel, and so bounds which apply in this 
scenario are also known as 'one-shot' bounds. These are 
the subject of a number of recent results in quantum in- 
formation 1 -5 and remain an active topic of research in 
classical information [5J [7] . All bounds referred to in the 
remainder of this introduction are of this type. 

Mosonyi and Datta [T], Wang and Renner [5| and 
Renes and Renner [3] have given converse and achievabil- 
ity bounds for classical-quantum channels. These can be 
applied to unassisted coding over general quantum chan- 
nels by maximising the bound for the classical-quantum 
channel induced by a particular choice of encoding over 
all such choices. In [3] Datta and Hsieh derive converse 
and achievability results for entanglement-assisted coding 
over quantum channels in terms of smoothed min- and 
max-entropies. 

The present work was inspired by a converse for classi- 
cal channels (hereafter the 'PPV converse') given in a pa- 
per by Polyanskiy, Poor and Verdu [5] and further inves- 
tigated by Polyanskiy [7j and Matthews [8]. We provide 
converses for both entanglement-assisted and unassisted 
coding over quantum channels, in terms of a quantum 



hypothesis testing problem (illustrated in Fig. [3]) with 
restrictions on the measurements for the unassisted case. 

This provides a unifying framework which not only in- 
cludes the the PPV converse (for channels with finite 
input/output alphabets) but also the existing converse 
of Wang and Renner [2] (as applied to general quantum 
channels) as a special case. 

Our converse for entanglement-assisted codes has some 
good properties. For a fixed blocklength, the converse of 
Datta and Hsieh [3] is unbounded as e — > whereas our 
bound is a decreasing function of e. It has a formulation 
as a semidefinite program, and its computation can be 
simplified (sometimes greatly) by using any symmetries 
the channel may possess. As an example, we show how 
to compute our bound exactly for n uses of a depolarising 
channel (see Section III and Fig. [lj. 

We give more than one converse for unassisted codes, 
which have various advantages and disadvantages. The 
next subsection describes the channel coding problem 
and gives a summary of our results. 




FIG. 1: Our upper bound Q on the rate of entanglement- 
assisted codes evaluated for three different error probabilities 
e, for the qubit depolarising channel with failure probability 
0.15. The red line marks the capacity of the channel (roughly 
1.31 bits/channel use) as given by the formula of Bennett, 
Shor, Smolin and Thapliyal [9]. 
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A. Problem formulation and summary of results 

As usual, a quantum system Q is associated with a 
Hilbcrt space Hq. By the dimension of the system 
dim(Q) we mean the dimension of the associated space, 
and this work deals only with finite-dimensional systems. 
By a state of Q we mean a density (i.e. positive, trace 
one) operator on Hq. We denote the set of all such den- 
sity operators by V(Hq). 

By an operation with input system A and output sys- 
tem B we mean a linear map from T>(Ha) to T>(Hb) 
which is completely positive and trace preserving. We 
denote the set of all such operations by ops(A — » B). 

It is convenient to assume that every quantum system 
Q comes equipped with a canonical orthonormal basis 
which we call the classical basis, and whose members we 
denote by |z)q for i = 1, . . . , dim(Q). 

As usual, a use (or uses) of a quantum channel with 
input system A and output system B is represented by 
an operation £b|a G ops(A — > B). 
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FIG. 2: An entanglement-assisted code Z transmitting a 
message W chosen by a source 5, via a channel use £ . The 
average channel input induced by the source and encoding is 

pa = £™=i 5 MpHa. 

Definition 1. In an entanglement-assisted code of 

size M, the sender and receiver have systems Ae and Be 
in an entangled state fA E B E , and for each message w € 
{1,2,..., M} there is an encoding operation C(w) a\a e € 
ops(AE —> A). Following the use(s) of the channel, the 
decoder performs a POVM Z?b e b on BeB to obtain the 
decoded message. 

Definition 2. An unassisted code can be viewed as 
a degenerate case where the decoding measurement op- 
erates only on the channel output B. Since Be is com- 
pletely ignored, there is no loss of generality if we take 
Be and Ae to be trivial, one- dimensional systems. Then 
C{w) is completely specified by its constant output p(w)a 
on A. 

Figure [2] illustrates an entanglement-assisted code Z 
transmitting a message W chosen by a source S via a 
channel use £b|a- The message W and the outcome W 
of the decoding POVM are classical random variables. 
The source is specified by the probabilities 

S(w) := Pv(W = w\S). 



The probability of error (which depends on the source, 
code and channel) is 

Pr(W ^ W\£,Z,S). 

For an integer M let Sm denote a source with M 
equiprobable messages i.e. Sm{w) = 1/M. 

Definition 3. We call a size M code Z an (M, e, pA.) 
code for £, if Pr(VF ^ W\£, Z,Sm) < e, and its average 
input state is pa- 

We denote by M c c (£b|Ai Pa) the largest M such that 
there is an (M,c,pa) code in class C for £b|a- If C = 
E then the codes can be entanglement-assisted; if it is 
omitted, we only allow unassisted codes. 

Remark 4. Clearly, the largest size of any code in C 
with error probability e (which we denote by M £ c (£b|a)) 
is just max PAeP(WA ) M e c (f B |A, Pa)- 

In [S] Polyanskiy, Poor and Verdu showed that many 
existing classical converse results can be easily derived 
from a finite blocklength converse which is obtained by 
a simple and conceptually appealing argument relating 
coding to hypothesis testing on the joint distribution of 
the channel input and channel output. 
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FIG. 3: The quantum hypothesis testing problem which ap- 
pears in our bounds. 



Our key results (Theorem 14 and Theorem 15) are 
generalisations of the PPV bound to quantum channels. 

As shown in Fig. [3] the hypotheses specify quantum 
states of a bipartite system AB, where B is the out- 
put system of £b|a an d A is isomorphic to its input 

system. Define a canonical purification of the average 

i i 

input state p A by Vaa : = Pa^aaPa' where $ A A := 
^dim(A) |i) A |i) A (j| A (j| A . Hypothesis H is that AB is 

in the state [p;£]ab : ~ ^biaIV'aa] obtained by acting on 
this purification with the channel £b|A: whereas hypoth- 
esis Hi is that the state of system B has been replaced 
by ctb, resulting in the product state p^ctb- 

In section |TT] we obtain converses for both 
entanglement-assisted and unassisted codes as a function 
of the minimum type II error /3 :— Pr(accept Hq\Hi,T) 
(i.e. the probability that the test T incorrectly accepts 
Hq when actually Hi was true), for tests T which have 
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type I error a := Pr(accept Hi\Hq,T) no greater than e, 
and which can be implemented by operations in a class 
f2 which depends on the class of codes: 



^ 2 ([p;^]abIIpa^b) 



(1) 

i"A i !! iI/>;% B ,^ B ) (2) 

{T^abPa^b : TrT AB [p;£] AB > 1 - e} 

(3) 

where T°(A, B) is the set of all tests (POVM elements) 
on the system AB which can be implemented by opera- 
tions in the class $7, (what this means is defined in Sec- 
tion IC). The class ALL only demands that T AB be a 
valid POVM element (0 < T AB < 1). By generalising the 
construction in [5] which takes codes for classical chan- 
nels to classical hypothesis tests to a construction which 
takes entanglement-assisted codes to quantum tests (i.e. 
POVM elements), we show that log M £ E (£ B | A , p A ) 
mhio-g D e ([p; £ } AB \\p\ ob), and hence that 



< 



logMf(£ B | A ) < maxmmD £ ([p;E] kB \\p\a B ) (4) 

PA OB ^ 

(where the omission of the test class superscript means 
that it is ALL). When the channel is classical this bound 
reduces to the PPV converse. 

The class L of local tests corresponds operationally to 
those which can be implemented by classical hypothe- 
sis testing on the joint outcome of local measurements 
(possibly correlated by shared randomness) on the two 
subsystems. Since our construction is shown to take 
unassisted codes to local tests, we obtain the bounds 
logM e (£ B | A ,p A ) < min^ D^([p; £} AB , p* A <r B ) and 

logM e (£ B | A ) < maxminD^([p;£] AB ,/^cr B )- (5) 

PA CB 

The bound Q for entanglement-assisted codes has a 
number of desirable properties: 

1. It is asymptotically tight for memoryless chan- 
nels. In common with the bound of Datta and 
Hsieh [1], analysing the large block length be- 
haviour of the bound for memoryless channels re- 
covers the converse part of the single-letter formula 
for entanglement-assisted capacity proven by Ben- 
nett, Shor, Smolin and Thapliyal [S], as we show in 
subsection MIDI 

2. Generalising results of Polyanskiy [7] we show that 
D e ([p;£] AB \\p*ra B ) is concave in p A and convex in 
a B . This enables one to use symmetries of the chan- 
nel to restrict the optimisation over p A and a B to 
states with corresponding symmetries, as we show 
in subsection MI El 



3. In subsection III B we give an explicit formulation 
of the bound as semidefinite program (SDP) which 
is a natural generalisation of the linear program 
(LP) given in [5] for the PPV converse. 



In subsection III C| the Wang-Renner bound is shown 
to be equivalent to making the (sometimes suboptimal 
[TJ) choice a B = £ b \a[pa] and taking Jl to be the class of 
operations LCI which can be implemented by local op- 
erations and one-way classical communication from Alice 
and Bob. 

Since the Wang-Renner bound is asymptotically tight 
for the unassisted capacity (and even for the product 
state capacity, thus recovering the HSW theorem), the 
stronger bound ^ also has these properties, but is oth- 
erwise less attractive, as it lacks an SDP formulation and 
does not possess the convexity property mentioned above 
[10] . However, the formulation in terms of restricted hy- 
pothesis testing makes it clear that by moving to less re- 
strictive conditions on the test, we might obtain weaker, 
but more tractable bounds. 

When 1~2 is LCI, or the larger class PPT (of oper- 
ations which remain completely positive when preceded 
and followed by partial transposition ) , the convexity 
property does hold (see Theorem 19), as does the sym- 
metrisation argument. 

For PPT the bound is given by an SDP (see subsec- 
tion IIIB). It seems unlikely that the PPT bound is in 



general, asymptotically tight, but it might prove useful 
for certain channels. 

To demonstrate the use of our bound on entanglement- 
assisted codes, we show in section |TV| how to evaluate it 
exactly for depolarising channels. We also discuss the 
relationship of the work to existing results on strong 
converse bounds for quantum channels, and to security 
proofs in the noisy-storage model (section [v]). 



B. Classes of operation on bipartite systems 

By a sub-operation we mean a trace non-increasing 
completely positive map. Let subops(A — > A') denote 
the set of all sub-operations taking states of the system 
A to states of the system A'. 

Similarly, let subops r! (A A',B ->■ B') (ops r! (A -> 
A', B — > B')) denote the set of all sub-operations (opera- 
tions) taking states of the bipartite system AB to states 
of A'B', which belong to class O. 

A sub-operation £ A 'b'|ab € subops(A — > A',B — > 
B') belongs to the class PPT if it is positive-partial- 
transpose preserving, i.e. if t B ' o C o t B is completely 
positive, where t denotes the transpose map. £ a <b'|ab 
belongs to LCI if it can be implemented by local opera- 
tions and one-way classical communication from Alice to 
Bob. This means it can be written in the form 



C 



A'B'|AB 



(a) ^.(a) 



|A^B'|B 



where £ a ^ A "j A e subops(A -> A') and £> B "j B G 
subops(B — > B') for each a. Throughout, we will omit 
tensor products if it is clear which system the operations 
act on. £ A 'b'|ab belongs to L if it can be implemented 
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by local operations and shared randomness, which means 
it can be written 

£a'B'|AB = ^2 Pr ^ a]a V B'\B 

r 

where 7"a'|A e subops(A -> A') and 2?^ B € 
subops(B — > B'). These classes are all closed under 
composition of compatible sub-operations. They are also 
closed under convex combination. Furthermore, they 
form a hierarchy PPT D LCI D L [331 02]. 

Let M (A, B) denote the set of all operations 
Mq\ab G ops(AB -> C) of the form 

dim(C) 

X C |AB- E |fc)(fc|cTr A , B '4 fe 'B'|AB 
fe=l 

where C is a finite-dimensional system acting as a classi- 
fy) a / 
cal resgister, and where each £\'B'|ab e subops (A — > 

A'. 15 ^15';. 

C. Quantum Hypothesis Testing with Restricted 
Measurements. 

In a classical hypothesis testing problem (with simple 
hypothesis) there are two hypotheses H and Hi of the 
form is 'The random variable R has distribution PW'. 
A statistical test T can be specified by the giving the 
probabilities T{r) — Pr(accept Hq\T, R = r). 

The 'type-I error' of T is 

a (p(°\T) =Pr(accept H±\H ,T) (6) 
=l-£p(°)( r )T(r) (7) 

r 

while the 'type-II error' of T is 

P{P {1) ,T) =Pr(accept H \H U T) (8) 
= J2P (1 Hr)T(r). (9) 

r 

Definition 5. 

/3 e (p(°)||p(i)) :=m in £ (pW ( T) (10) 
subject to 

a(p(°),T)<e, (11) 
Vr : < T(r) < 1. (12) 

In a quantum hypothesis testing problem (with simple 
hypothesis) there are two hypotheses Ho and Hi of the 
form is 'The state of system Q is Tq . In order to distin- 
guish between these situations it is necessary to perform 
a measurement Q, the outcome of which is then subjected 
to a classical hypothesis test. 

We extend Definition [5] to the quantum case, first to 
both diagonal in the classical basis by the trivial 
identification of such states with distributions, and then 
to the general case by 



Definition 6. 

/SJViSM) :=inf/3 £ (A4 C | A B[r| i]||Xc|AB[r|^]) (13) 
subject to 

M c \ab GM"(A,B). (14) 

and D? (r AB ||r«) := -log/?f (t a ° b ||t ab ). 

A trivial but very useful result is the following gener- 
alised data processing inequality. 

Proposition 7 (Data processing inequality). If fi 

is closed under composition, then for any operation 
A/a'b<|ab G ops n (A -> A',B -> B') 

-Dp(A/A'B'|AB [ t ab] ll-^A'B'|AB [ t ab]) < B?(Tj®\\TW). 

Corollary 8. If there is also an operation A^ B i A , B , € 
ops n (A' -> A, B' -> B) such that A/ AB | A , B , o 

■A/a'B'|ab[7ab] = Tab and -^abia'B' ° -^a'B'iab^ab] = 

T AB fh en 

Z3p(A/' A 'B'|AB [^AbIII-Va'B'IAB [ t Ab]) = ^f(^ll^)- 

Since the states of C in Definition [6] are classical, there 
is an optimal test which measures in the classical ba- 
sis. Supposing that it has probability pk of accepting 
H when it measures k, then the probability of the test 
accepting Ho when the state of AB is r AB is 

^PfcTr A 'B'^A'B'|AB[ r AB] =Tr A 'B'^A'B'|AB [TAb] (15) 
k 

=Tr AB T AB [r AB ] (16) 

where C = Y,kPkC {k) G subops" (A ->■ A',B B') 
by convexity, and T A b = Yli M(iy M(i) where M(i) are 
Kraus elements for C. Tab is the POVM element (which 
we call simply a 'test'), of the outcome of a measurement 
on AB which can be implemented by operations in Jl 
which corresponds to the acceptance of hypothesis Hq. 
We denote the set of such tests on AB by by T n (A,B). 

Proposition 9. 

/^abH^) := inf Trr AB T AB (17) 
subject to 

Ttt^Tab >l~e, (18) 

Tab G T r2 (A, B). (19) 

Proposition 10. Tab G T ppt (A,B) if and only if < 
Tab < A ab and < t B [Tab] < ^ab where t B is the 
transposition map on system B. (See 1131.) 

Remark 11. For the classes of operations considered 
in this paper (ALL, PPT, LCI and L), T°(A,B) is a 
closed set, and so for these classes the infimum in Propo- 
sition [9] and Definition [6] can be replaced by a minimum. 
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II. FINITE BLOCKLENGTH CONVERSES 

In this section we generalise the proof of the PPV clas- 
sical converse (see Section III.E of [6]), to obtain the fol- 
lowing bounds, which are analogous to Theorem 27 of [BJ. 
To state them, let us first introduce two quantities. 

Definition 12. 

iii* i* 
[p;£} AB := £b|a[Pa*aaPa] = P\ £b|a[$ aa ]p a 

Definition 13. By analogy with the relationship be- 
tween the mutual information and relative entropy, we 
define: 

I?(£;p) :=rmnZ?p([p;% B ||pta B ). (20) 

u 

Theorem 14 (Entanglement-assisted converse). 

logM e E (£ B |A,/OA) <^(^b|a,Pa), and so (21) 
logM £ E (£ B |A) <max/ e (£ B | A ,p A ). (22) 

PA 

Theorem 15 (Unassisted converse). 

logM e (£ B |A)PA) <^(^b|A,Pa)j and so (23) 



logM e (£ B | A ) <max/ e L (£ B |A,PA)- 

PA 



(24) 



Corollary 16. From the hierarchy of operations, it fol- 
lows immediately that 

logM e (£ B | A ,p A ) </ PPT (£ B |A,/0 A ), and so (25) 
logM e (£ B | A ) <max/ PPT (£ B | A) p A ). (26) 

' PA 

Just as in 6J, these results are a consequence of a 
more general 'meta-converse', which relates the proba- 
bility of correctly sending information over the channel 
to the problem of hypothesis testing. To establish this, 
we first prove (using the notation established in Defini- 
tion [TJ 

Proposition 17. From any entanglement-assisted code 
Z and source S, such that the average input state is pA, 
one can construct a test T AB 6 T(A,B) such that 

Pr(W = W\E,Z,S) = Tr[p;% B T AB . 

Furthermore, if Z is unassisted, then T Ab G T L (A,B). 

Proof. Since it is always possible to augment Ae to 
A E 'A E , take C(w) A | Ae / Ab [-] := C(w) A | AE [Tr AE <(-)] and 
^Ae'AeBe a purification of \I/ Ae b e , we can assume that 
*a e b e is pure. 



Let the isometry [/(u>)ga<-a e be the Stinespring rep- 
resentation of the encoding map C(w) A | Ae , where G is 
the discarded environment system. In fact, we can take 
Ae = GA so that C/(w)ga<-ga =: U{w) G a is a uni- 
tary, anda complete C(w) A | Ae [-] = TrG-U(w)[-] where 

il(w)[-] := U(w) G a[ ■ }U(w) GA . Finally, there is no 




W 



FIG. 4: Reformulation of the protocol of Fig. [2] 



loss of generally in demanding that Be = GA = GA. 
This reformulation of the protocol of Fig. [2] is illustrated 
in Fig. [4] 

First note that 



^(w)ga* G aga C/ ( w )ga 

=M(w) GA $ GAG A M MgA 

1 

where M(w)qa '•= U(w)ga^qa^ an< ^ that 



(27) 
(28) 



pHga = M(w) GA M(w)l A (29) 

is the state of GA after encoding if the message W — w. 
Referring to the diagram, we see that 

Pr(W = w\W = w,£,Z) = TrD(w) eAB £H 5AB 

where 

ZHgab =Tt g £ bia [M(w) ga ^ gaga M(w)I a ] (30) 
=Tr G M( W )l A <P GG £ BlA {$ AA }M(w) GA (31) 



--M(w)l x l G £ BlA [^ AA ]M(z 



(32) 



Here we have used the easily verified fact that, for any 
linear operator M A on Ha, M A $ AA = Mj$ AA . The 
probability of successful decoding is 

Pr(W> = W\£,Z,S) (33) 

M 

= £ S(w)TrD(w) GAB £(w) GAB (34) 

w=l 

=Trl e ,S B | A [$ AA ] J R eAB (35) 



where 



M 



Rgab ■= E S(w)M(wy GA D(w] 



GAB 



M(w) 



T 

GA' 



(36) 



Since R, 



GAB 



is given by a completely positive map 



(with Kraus operators S (w) M (w) GA ) acting on D 



which satisfies Df< 



'GAB' 



GAB 



^ ^GAB' we have ^ ^GAB ^ 



P* ga ^b, where p GA = Y,w=i S ( w )p( w )ga is the average 
state of GA after encoding. Therefore, 

-i* -i* 

T AB '■= P~A R AbP a 2 ( 37 ) 



6 



satisfies < T AB < 1 AB , and 
Pr(W = W\S,Z,S) =Tr AB 4*£ B |A[<fAA]Pl* T AB (38) 



If £ B X has £^' a [pa] — o B for all pA, and S = Sm 
(i.e. the M messages are equiprobable), then it is easily 
verified that 1 — €\ = 1/M (in fact any other value would 
imply that communication is possible in the absence of a 
channel.) 

Setting £ B ^ A = £b|Aj and maximising over a B , we see 
that any {M,e,px) code whose corresponding test be- 
longs to T n (A, B) must satisfy 



A I 1 A 
= Tr AB [P'i £\ Ab^Ab 



(39) 



as promised. 

As noted in the caption for Fig. [2j any unassisted 
quantum code corresponds to restricting Bob's decoding 
measurement to the output system of the channel, so that 

Ww 6 {1, ... , M} : D(w) 6lB = l &A D(w) B . 



Substituting this into (361, we see that 



M 



T AB = ]T E(w) A D(w) B 



w=X 

where the positive operators 

E(w) A := S(w)(pl)-i (Tr & p(w)l A ) 
satisfy 

M 



max/^(([p;£] AB) p A <7 B ]) < 1/M. 



(41) 



For entanglement-assisted codes, rearranging this and us- 
ing Definition [6] gives us 

logM e E (£ B | A ,/9 A ) < mmD([p;£] AB \\p* A a B ) 



(Theorem 14) and for unassisted codes, we can write the 
stronger, 

logM e (£ B | A ,p A ) < min£> L ([p;£] AB ||p A <7 B ) 



(which is Theorem 151 



Letting £7(0) A := 1 — p~ 2 * p\p~> 2 * , the operators 



A^A 

E(0) A , . . . ,E(M) A constitute a POVM, and so, for an 
unassisted code, T AB is a local test i.e. T AB e T L (A,B). 

□ 

A quantum generalisation of the 'meta-converse' The- 
orem 26 of [BJ, is now straightforward: 

Proposition 18 (Meta-converse). Let Z be an 
entanglement-assisted code which, when used with S, in- 
duces the average input state pa, and which has success 
probability 

Pt(W = W\£®, Z,S) = l-ei, 

when used with channel use £ B \ A ■ Consider the hypothe- 
sis testing problem where Hi asserts that the state of AB 
is [p;£ ] AB - If we accept Hq when the test constructed 
from {Z,S) as in Proposition 17 accepts, then 



f3 = Tr[ P ;£W] AB T AB = l-e 1 



and 



1 



Tr[p;£:( )] AB T AB = l-£o. 



Therefore, 



/3 e o([p;f (0) ] AB ll[p;f (1) ] AB )<i-^- 

Furthermore, if constraints on the code Z mean that T AB 
is guaranteed to belong to some class of tests T n (A,B), 
then the (potentially more stringent) bound 



/3"([p;^ (0) ]abII[p^ (1) ]ab)<i-^i 



(40) 



also applies. 



III. PROPERTIES OF THE BOUNDS 

The results of Polyanskiy [7J, show that when con- 
strained to classical states p A and classical channels £^°\ 
£«, the quantity /? e ([p; £^} AB , \p; £ W ] AB ) is jointly 
convex in p A and e. Here we generalise the proof to 
quantum states and channels, even for restricted mea- 
surements ft = PPT or LCI. 

Theorem 19. For any CPTP maps and£ {1 \ and 
class of bipartite operations l~i which contains LCI, the 
function 

f n (e,p) :=P?(l£W;p} AB ,l£^;p} AB ) 
is jointly convex in e and p. 

Proof. First, note that f n (e,p) = f n (e,p*) where 

f n (e,p) := &(4>|W A ]4M $ AA/4]) 

so it will suffice to show the joint convexity property 
for / . Suppose we have ej,p(j) A , and Xj for j = 
l,...,m, with ^2j—iXj = 1. Let e = Ylj=i ^j e j an d 

P = EJli *jp(Jh- 

Let Tj be a test in T n that achieves f n (ej,pj), and 
let 

rn 
3=1 

This test can be implemented in the following way: 

First Alice performs an measurement with measurement 
i _ i 

operator \/XjP{j) A P A 2 for outcome j = 1, . . . m, and an 
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operator for outcome j = to make the measurement 
complete. 

Now, based on the outcome of this measurement j 
(which is classically communicated from Alice to Bob) 
Alice and Bob perform the measurement to implement 
the test T(j) AB (they accept Hq when the this test 
T(j) AB would have). 

Since this implementation uses only one-way classi- 
cal communication followed by operations in class f2, 
which, by hypothesis, contains LCI, this test is also in 
T"(A,B). The type-I error a for T AB is 

T 4>Ka4]7ab 



=Tr£ A jP ( J )|T(j) AbP ( J )|£^[$ AaJ 



J = l 



i=i 



e j ) = \-e. 



Similarly, the type-II error /3 is 



Therefore f n (e,p A ) < J2j A j7°( e ^ P{j)a)i as claimed. 

□ 

Corollary 20. I e (£,p) is jointly concave in e and p. 



A. Classical channels 

Let Ca\a and Cb|b denote the completely dephasing 
operations in the classical bases for A and B, respec- 
tively. If £b|a is classical then £b|a ° Ca\a 



£b|a, and 

£ B \ Al and we can therefore restrict the 
minimization over average input states in our bounds to 
states pa which are diagonal in the classical basis, thus 



Cb|b ° £b|a 



Pa 



J2p(x)\x)(x\ A . 



Furthermore, since the state [p;£]ab * s invariant under 
the operation Cb|b & ops L (A — > A, B — > B), for any class 
fl containing L we have (by Theorem [7| 

D?([p;£] ab \\p* a <tb) (42) 
>D?(C m [p;£] AB \\C m pla B ) (43) 
=D?([p;£] AB \\p A * B ) (44) 

where a' B is diagonal in the classical basis of B. There- 
fore, we can also restrict the optimisation over a B to 
classical states. Therefore, Theorems [l4| and [15] both re- 
duce to the PPV converse for finite alphabets when the 
channel is classical. 



B. Semideflnite programs 

Looking at Proposition [9] it is clear that /3 e is given 
by the solution of a semidefinite program (SDP). What 
is less obvious, is that I<:(£b\a, Pa) and its maximisation 
over pa can also be formulated as (simple functions of 
the solutions to) SDPs. We have 



^(£b|a,Pa) =minZ? £ ([p;£] AB ||p A cr B ) 

= - log max min Tr AB p*r a B T 



AB 



subject to the constraints 

Tr[p;% B r AB >l-e, < T Ab < 1 Ab 



(45) 
(46) 

(47) 



By von Neumann's minimax theorem, the order of the 
maximization and minimization in the equation ( 46 ) can 



be reversed. Then, using max ffBeI ,( WB ) Tr AB p A o- B T AB 
max CTBeX , (WB) Tr B CT B (Tr A p A T AB ) = WTr^tT^W^ 
min{A : A1b > TrAP A T AB } we have 



^(^bia,Pa) 



log min A 

T,A 



subject to 
A1 B >Tr A piT AB 



(48) 



(49) 



and the contraints (47). With the change of variables 

*i *i 

-^ab := Pa^AbPx i this is equivalent to 



Proposition 21 (Primal SDP) 

-M£b|a,Pa) : 



= — log min A 

T,A 

subject to 
Tr A i? AB < Ai B , 
Trf B | A [<i> A A]^AB >l-e. 
R AB < P* a £ b, 
R AB > 0. 



(50) 

(51) 
(52) 
(53) 
(54) 



Since the constraints on p A are semidefinite, the bound 
max h{£ B \ A ,p A ) 

Pa62?CH a ) 

from Theorem \lJ\ is also a semidefinite program. 

Remark 22. The constraint < tBT As < 1 (where t B 
is the transpose map on system B) is equivalent to 

o < t B i? AB < p a i b . 

Because the transpose map is linear, adding these con- 
straints on -R Ab to the primal SDP above yields an SDP 
for/f PT (£:B|A,PA). 

Associating operators Fz B and G B with constraints 



(53) and (51), and a real multiplier p with (52) yields 



the Lagrangian 



A 



TrG B (Tr A i? AB - A1 B ) 
+TrF AB (R AB -p* A ®t B ) 

+fi(l-e-TrR- AB £ m [^ AA ]) 
=Tri? AB (H A ® G B + ^b - ^b|a[* Aa ]) 
+A(1 - TrG B ) + (1 - e)fi. 

from which one can derive the dual SDP. Below, we show 
that the optimal value of this dual SDP is equal to the 
optimal value of the primal. 



Proposition 23 (Dual SDP). 

^(£b|a, Pa) = - logmax(l - e)/i - TrF A p* A 
subject to 



$G B +F AB > //£ B | A [$ AA ], 
TrG B < 1, 

g b ,f Ab ,v>o. 



(55) 

(56) 
(57) 
(58) 



Proof. For sufficiently large a, the point given by Gb = 
lB/(2dim(B)), F AB = al^ R , a nd any p, > strictly 
satisfies the dual constraints ( 56]|58 ), so the dual SDP is 
strictly feasible, and therefore its solution is equal to the 
primal solution (see Theorem 3.1 of [14]). 

□ 



The maximisation of ( 55 ) over states p A of A can also 



be formulated as an SDP, in a similar way to the primal. 



C. Comparison with Wang— Renner 

In our notation, the Wang-Renner converse states that 
for c-q channels with finite input alphabet A and output 
states t(x)b for i£A 

logM e < sup AsOcb 1 1 t c t b ) 
p 

where C is a system of dimension |A| and tcb := 
^2 xeA p{x)\x){x\c ® t{x)-q. To apply their converse to 
general channels, one notes that any (unassisted) code 
for a general quantum channel, induces a c-q channel by 
its specification of the input states used in the code. To- 
gether with the choice of p the yields a c-q state 

tcb = ^2p(x)\x)(x\ c ® £b\a[p(x)a] 

X 

Optimising the Wang-Renner bound over all choices of 
input states and distributions p such that the average 
channel input to the quantum channel is p A , one obtains 



M e (£ B \ A ) < Xe(£ B |A,PA) 



Definition 24. 

Xe(^B|A,PA) : 

and 



max -D £ (£ B |a?7ca||?7c£b|aPa)- 

t)CA6Ens(p A ) 



Definition 25. Let Ens(pA) be the set of all states of 
the form 

dim(C) 

Pk\k)(k\ c ®p(k) A 

fe=i 

where C is some finite dimensional system (acting as a 
classical register) and where pk > 0, J2kPk — 1) p(^)a € 
V(H A ), and 



^2pkP(k) A = 



Pa- 



This notation is motivated by the fact that the Holevo 
bound [15] is given by 

x(£b|a,Pa)= max £>(£b|a?7ca||»7c£b|aPa), 

»)CAeEns(p A ) 

where D is the usual quantum relative entropy. 
Proposition 26. 

X £ (£b|a,Pa) = £ £ lc1 ([p;%bIIp a £b|a[pa]) 

Proof. Let Alice's measurement have the POVM ele- 
ments F(k) A where k labels the outcome which she sends 
to Bob. There is no loss of generality in having Alice per- 
form before Bob does anything, and storing the outcome 
in a classical register C to which Bob has access. 

Letting p(k) := Trp* A F(k) A be the probability of out- 

i i 

come k and p{k) A = Tr A (p A , 5 AA /9 A )i 7 A (fc), under hy- 
pothesis the state of CB is ^bia^cA; where 



VGA 



while under hypothesis 1 the state is ^c^bia^a- Clearly 
VA = J2kP( k )p(k) A = PA- Therefore, 



^ lc1 ([p;£]abIIp a £b|a[pa]) 

max 

t/ca£Eiis(pa) 
=Xe(^B|A,PA) 



max D £ (£ b \ A i]ca\\vcSb\aPa) 

i)CA6Ens(p A ) 



□ 



where 



Corollary 27. From the above proposition and the def- 
initions of the quantities involved, the inequalities 

^ L (^B|A,Pa) < -f £ LC1 (£B|A,PA) < Xe(^B|A,PA) 

follow immediately. 
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D. Asymptotics 

It was already noted in [2] that the (asymptotically 
tight) Holevo bound on unassisted codes can be recovered 
from an asymptotic analysis of the Wang-Renner bound, 
which our bounds on unassisted codes subsume (in fact, 
an argument of j5] can be used to show that the converse 
part of the HSW theorem [T5], [T7] can also be derived). 

For entanglement-assisted coding over memoryless 
quantum channels, Shannon's noisy channel coding theo- 
rem has a beautiful generalisation due to Bennett, Shor, 
Smolin and Thapliyal: 



lim lim -logM e E ((£^V|A») = max/(£ B |A; Pa) 

e— >0 n— >oo n pa 



(59) 



where 



I(£b\a\ Pa) ■= S(p A ) + S(£ b \ A (pa)) - S([p; £} AB ) 

is the quantum mutual information between systems A 
and B when the state of AB is [p;£] AB . As noted, for 
classical channels Theorem [14] reduces to the Theorem 
27 of [6]. In section III.G of 6] it is shown how to derive 
a Fano-type converse from this theorem. The deriva- 
tion and result generalise perfectly to the entanglement- 
assisted codes for quantum channels: As usual, the bi- 
nary entropy is h(p) := —(1 — p)log(l — p) —plogp, and 
the binary relative entropy is 



d(p\\q) :=D((p,l-p)\\(q,l-q)) 

=p\og - + (l-p) log - — - 
q 1-q 

>p\og- + h(p). 



(60) 
(61) 

(62) 



By the data processing inequality for quantum relative 
entropy under CPTP maps, and (62) 



D(po\\pi) >d(l-e||&(pO,Pi)) 
>(l-e)log ' 



He). 



(63) 
(64) 



Therefore, 

A(pollPi) < (D(po\\pi) - h(e))/(l - e). (65) 
i i 

Setting p a = £ B |a[Pa $ aa/°a] and p x = p\o B , and mini- 
mizing over <tb yields 

Lemma 28. 

/ e (£B|A, PA) < {I(£b\a; Pa) ~ h(e))/(l - e) (66) 

The converse part of this theorem is easily derived from 
the previous lemma, which tells us that 

Alo g Mf (g )<lmax / ^" ; ^)-^. (67) 
n n pa 1 — e 



In [TS] Adami and Cerf show that 

/ (4 1 1 ) |A 1 4 2 2 ) |A2'^ A i A ^ = J (4 1 1 ) |A 1 'PA 1 ) + i"(4, ) |A 3 >A'A 3 ), 

so we have 



max/((£" 8,n ) B «|A",/OA") < nm&x I(£ b \a, Pa), 

PA" PA 



and 



lim -logM e ((f^) B „| A „) < J— ma xI(£ BlA ;p A ), 
n— >oo n i — e p 

(68) 

and taking the limit e — > completes the proof. 



E. Using symmetries 

We now show how symmetries of £b\a can be used to 
simplify the computation of /"(£ B |A> Pa)- For classical 
channels, analogous results were obtained in [7J (but note 
that [7J also deals with infinite input /output alphabets) 
and similar ideas discussed in [5]. 

Suppose that there is a group G with an action g A (<7a) 
on states of A (and of A) given by 

9 at A = U{g) A T A U(g) A , 

and an action g B on states of B 

5bt b = V(g) B T B V(g) B , 

where U and V are unitary representations of G, such 
that 

Vg e G : £ B \ A (g A p A ) = <?b£b|a(pa)- 
Proposition 29. For any f2 D L and for all g G G, 

D?([£]g A p} AB \\g* A p A g B o- B ) = D?([p;£] AB \\p A <7 B ) (69) 

and I"{£ b \A,9aPa) = I"(£b\a,Pa)- 
Proof. The first claim follows from 

[£;9ap\ ab 

=£ BlA [U(g) A p A U(g) A $ AA U(g) A p A U(g) A ] 

=£B\ A [U{g) APA U{g)\® AA U{g)lp A U{g)i] 

=V(g) B U{g)\£ B \ A [pi$ AA p A )V{g) B U{g)l 

=9* a 9b[p;£] ab , 

the fact that g*g B , {g^TAg^h e L(A -> A,B -> B) C 
f2, and Corollary [8j We use this to prove the second 
claim thus: 

I? (£b I A , Pa) = min ( [p; £ ] AB \ \ p A a B ) 

=D?([p;£] AB \\p A o- B ) 
=D?([£; g A p] AB \\g A p A gBO- B ) 

> min Dp ([£; g A p] AB \\g A p A <J B ) 
=I?{£b\a,9aPa)- 
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Since g has an inverse in G, the reverse inequality holds 
too. 

□ 

Suppose that there is a Haar (G-invariant) measure p 
on G, and let pa '■= Jq <?A/°am(s) ■ Then, pA is invari- 
ant under the action g^, and by Jensen's inequality and 
Proposition [29} 



and for n uses the operation is 



^(£b|a,Pa) 



> 



h(£B\A,9ApA)n(9) = 4(£b|A,Pa) 



Therefore, the optimisation over pa can be restricted to 
those density operators invariant under the action of ^a- 
An important type of symmetry that an operation rep- 
resenting n channel uses may possess is permutation co- 
variance. For example, this applies to channels of the 
form £® n . 

For any element u of the symmetric group S n , and 
n-partite system Q n :— Q1Q2 ■ ■ ■ Qn consisting of n iso- 
morphic systems Qj, let ttq™ £ ops(Q™ — > Q n ) denote 
the unitary operation which permutes the n systems. 

An operation £b«|a™ £ ops(A" — > B n ) is permutation 
covariant if 

£b™|A« 1" A™ = ""B" £b»|A»- 

Suppose that, in addition to permutation invariance of 
the n uses, each use of the channel is G-covariant in the 
sense that 



£ 



B"|A' 



9a™ = 9b- ° £ 



B"|A" 



(1) (2) (n) . ,. t 

9 a, 9a 2 '■ ■ ■ 3a„ 1S an action of g = 
€ G x ", where we demand that the 



where gA™ 

(5 (1) ,ff (2) ,---,3 
individual <?Aj are ah the same, in the sense that, if xa^Aj 
is the permutation which swaps systems Aj and A.,- , then 
XA,A 3 o g Aj o xa s a, = 9Ai for alii, j £ {1, 2, . . . , n}, (and 
similarly for <7b»). 

To every ordered pair (w, g) where ir £ S n and g £ 
G x ™, we can associate an action (vr, g)A" '■— n A™ gA™- 
Under composition these actions constitute a group, 
which is a semi-direct product of G xn and S n (G x ™ be- 
ing the normal subgroup) which we denote S n k G xn . 
Dchning the action of (tt, g) € S n K G xn on states of B" 
by (tt, g) B « := 7TB" o gB™, we have 



£ B n\ A n ° (TT, g) A n = (vT, g) B " ° £] 



B"|A™- 



7-) (gin 
■^B-IA" 



i|Ai 



.2? 



B„|A„, 



which has the covariance group S n x \J(d) xn . 

The only input and output states with the correspond- 
ing invariances are the maximally mixed states. There- 
fore, 



sup/ £ (2>® n 



B n I A™ 



PA n ) 



AW(p)|: b JIma«mb») 



where (f>(p) AB '■= ^biaI^aa/^ = C 1 -p) $ AB/ d +PWB 
is an isotropic state. Since the arguments of D e com- 
mute, this is equivalent to a classical hypothesis test be- 
tween the distributions given by the eigenvalues of the 
two states. In fact it is equivalent to deciding between 
hypotheses on the distribution of n samples of a binary 
variable: Hypothesis H$ is that the samples are drawn 
i.i.d. with probability (1 — p) + p/d 2 of being 0, and 
hypothesis H\ is that the samples are drawn i.i.d. with 
probability 1/d 2 of being 0. Preparing system AjUj in 
state &{xj) produces pi when Hi is the case. Conversely, 
setting Xj to if $(0) is measured on system AjBj and 
setting it to 1 otherwise, yields Hi when the state is pi. 
So, by Corollary g| 

A(^)f: B J|MA^B») 
=A((M,l-M)® n ,(A,l-A)®") 



31 



where p = (l—p)+p/d 2 and A = 1/d 2 , and Proposition 
gives a formula for this quantity which is easy to evaluate 
exactly (as we have done for Fig. [I]). 

Proposition 31. Let p > A be two probabilities. 

MM, 1 - pT n , (A, 1 - Xf n ) = (1 - 7)&( £ ) + 7%)+i 
where 



Oil 



3=0 

n 

E 

3=1 



p?{\-p) n -i, 



A J '(l-A) n - J ', 



(70) 
(71) 



1(e) is the value of I satisfying a? < e < 0^+1 and 7 = 
(a - a e{a ))/(a e ( a)+1 - a t{a) ). 

Proof. This is just optimising over the optimal (classi- 
cal) hypothesis tests identified by the Ney man- Pearson 
lemma. The same expression is given in [6]. 

□ 



IV. EXAMPLES 



Example 30 (The depolarising channel). A single use of 
the d-dimensional depolarising channel with parameter p 
and d-dimensional input and output systems A and B 
has the operation 

£>b|a[ta] = (1 -p)r B + pTr(r B )p B , 



V. STRONG CONVERSES AND 
CRYPTOGRAPHY 

In [19] a strong converse was proven for the classical 
capacity of many quantum channels j\f, including depo- 
larising noise. In particular, it was shown that for any 
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rate R strictly above the capacity of such channels, there 
exists a j(R,J\f) > such that the success probability 
of transmitting Rn uniformly random bits through TV"®" 
decreases exponentially in n 

P S ucc(^",-R) <2~^ R '^ n , (72) 

where the so-called strong converse parameter is given by 



7 (i?,A0 



max | 1 

Ka<2 



(R-X* a W) 



(73) 



with XaC^O the a-Holevo quantity [T!5], and 



P sacc (Af® n ,R) = j max 



J2 Tr [M x M® n { Px )] 



x€{0,l} nR 



where the maximisation is taken over all encodings p x 
and decoding POVMs {M x }x- Note that for rates above 
the capacity, we thus have that for any fixed possible e 
that e > i-2-f (R ^'> n . Hence, for any e we can solve ([73]) 



to yield an upper bound on the rate R, and thus logM e , 
as 



log M e < 



1 



log(l-e) + X ;(A0 



(74) 



for any a <E (1,2]. Note, however, that the present work 
is of a fundamentally different nature in that we provide 
bounds for log M e for any rate, even below the capacity. 

The results of |T9j had a nice application to proving se- 
curity in so-called noisy-storage model of quantum cryp- 
tography j^Hj. In short, this model allows for the secure 
implementation of any two-party cryptographic task un- 
der the assumption that the adversary's quantum mem- 
ory is noisy. Examples of such tasks include bit com- 
mitment and oblivious transfer, which are impossible to 
achieve without assumptions [21] [22] . Whereas more re- 
cent work also allows us to link the security to the en- 
tanglement cost [23] or the quantum capacity [23], we 
here refer to a security statement in terms of the classi- 
cal capacity. Specifically, it was shown in [20] that for 
any c-q state pxq , where Alice holds X and Q is held by 
an adversary 

H min (X|JT(Q)) > - logP 8UCC (JT, LH min (X)J) . (75) 

That is, by understanding H m j n (A) alone and the proper- 
ties of the channel T we can bound the adversaries knowl- 
edge about a string X of length k. In [20] a bound on 
Hmin(^") > k/2 was obtained by an uncertainty relation 
for BB84 measurements that were used to generate the 
string X , up to a security error e. Different measurements 
lead to higher (or lower) values of H m i n (A)/fc =: R. The 
strong converse for the channel T = M® n was then used 
to bound the r.h.s. for the rate R = (kR)/n, as long as 
R exceeded the capacity. 

However, up to this date no security statements were 
known for arbitrary J-, i.e., channels that may not obey 
a strong converse or simply structureless channels. Let 



us sketch that our approach directly leads to a security 
statement for any T that can be bounded using a semidef- 
inite program (although it should be noted that this is 
not easy to evaluate if the input dimension of T is large) . 
Yet, this approach leads to security even without the use 
of a strong converse for any J 7 . More specifically, we will 
turn things around and ask how large we have to choose 
k such that sending kR bits through the channel T will 
incur an error of at least e. Intuitively, our goal is to 
effectively overflow the adversary's storage device T . By 
the results of [2D] we can then bound the adversary's min- 
entropy as H min (A| J°(Q)) > — log(l — e). For any e > 0, 
our analysis yields a bound on log M e stating that if we 
were to transmit more than kR > logM e bits, the error 
is necessarily e + 5 > e for some S > 0, which yields the 
desired bound. As R is determined by an uncertainty re- 
lation, we thus know how many transmissions k we have 
to make to obtain security. 



VI. CONCLUSION 

We have shown how a simple and powerful idea [5] 
for obtaining a finite blocklength converse for classical 
channels in terms of a hypothesis testing problem can 
be generalised to quantum channels and entanglement- 
assisted codes. 

This generalisation has the property that a natural 
restriction on codes (removing entanglement-assistance) 
translates into a natural restriction on the tests that can 
be performed in the hypothesis testing problem (they 
must be local). This provides a strong link between the 
extensively studied problems of channel coding and hy- 
pothesis testing (and state discrimination) of bipartite 
systems under locality restrictions. 

Many avenues for further work are apparent to the au- 
thors: Subsection III E invites a more thorough investiga- 



tion into the extent to which symmetries can be used to 
simplify evaluation of the bound in various special cases, 
and especially in the case of general memoryless chan- 
nels, where one might hope for an exponential reduction 
in the size of the SDP, in a quantum generalisation of [8] . 

While we have been able to fit the existing converse of 
Wang and Renner [2] precisely into our hierarchy bounds 
based on restricted hypothesis testing, it is not obvious to 
the authors what relationship exists between our converse 
for entanglement-assisted codes and that of Datta and 
Hsieh [4] . We would like to know what can be said about 
this, particularly in light of the achievability bound given 
in g]. 

A limitation of our work is that we only generalise the 
classical bound of [5J for the case of finite input/output 
alphabets (as our input /output systems have finite di- 
mension). To analyse the general case will require greater 
mathematical sophistication (see [7]), but would be desir- 
able given (for example) the interest in quantum gaussian 
channels [251. 
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